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Abstract 

Scientists investigate the dynamics of complex systems with quantitative models, 
employing them to synthesize knowledge, to explain observations, and to forecast 
future system behavior. Complete specification of systems is impossible, so models 
must be simplified abstractions. Thus, the art of modeling involves deciding which 
system elements to include and determining how they should be represented. We 
view modeling as search through a space of candidate models that is guided by 
model objectives, theoretical knowledge, and empirical data. In this contribution, 
we introduce a method for representing process-based models that facilitates the dis- 
covery of models that explain observed behavior. This representation casts dynamic 
systems as interacting sets of processes that act on entities. Using this approach, a 
modeler first encodes relevant ecological knowledge into a library of generic entities 
and processes, then instantiates these theoretical components, and finally assembles 
candidate models from these elements. We illustrate this methodology with a model 
of the Ross Sea ecosystem. 
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1 Introduction and Motivation 



Models are critical tools for determining how elements combine to gener- 
ate the complex system dynamics that we observe in nature. Ecologists use 
models to synthesize existing syste m knowledge into a concise form that 



guide s empirical research programs ([Osidele and Beckl . l2004t IWhipple et al. 



2005 ), but they als o use statistical models to describe patterns in their data 
f)Underwood L 1997 ) , and simulation mode l s both to explain and predict system 



behavior AC 



Melillo et al. 



ark et all 12001 : Fordl 2000t Jorgensen and Bendoricchiol . 2001 . 
19931). Models also let scientists perform thought experiments 



that would not otherwise be possible or ethical. Becau se of thi s advantage, 
ecologists have used models to bu i ld eco l ogical theory (lAbramsl Il99"l |200CA 
Carpenter et al. . 19851 : Jorgensen . 2002: iPulliam and Da nielson. 1991) and 



to guide environmental assessment and management ( Brando et al 



Costanza and Ruth L 19981: Jorgensenl . 1994 ; Korfmacher , 2001 : iMaguird . 
Reckhowl . 11994 ISage et all 120031) . 



2004; 



2003; 



Quantitative models in ecology and environmental science are often catego- 
rized by the degree to which the i r structure corresponds to a real system 



nzed by tne degree to wnicn tne i r structure c orresponds to a real system 
flBosseill992tlHilborn and Mangel ll997tlLevinsl . ll96dll993[l0rzack and Sober 
19931 iReckhowl . Il994t Zeigler, Il974l ). This realism continuum begins with em- 
pirical models and ends with mechanisms. Empirical models (e.g., regression- 
based models) stem solely from observed relationships among variables, pro- 
vide a statistical summary of the data, and ignore mechanisms determining 
the behavior. In contrast, mechanistic models contain unobserved relation- 
ships, explain system dynamics, and emphasize the physical, chemical, and 
biological processes that generate system behavior. Ecologists use mechanistic 
models to understand how system behavior may change in response to chang- 
ing environmental conditions. A central problem of building models with more 
realistic structures is determining which entities and processes to include and 
which mathematical representation is most appropriate. 



Following lLanglev et al.l (J1987), we claim that model construction involves 
search through a space of possible models for ones that fit system observa- 
tions. This space contains alternative model structures (entities, processes 
and the connections among them), mathematical formulations, and parame- 
ter values. The immense number of possible models challenges scientists, who 
navigate this space by selecting the set of objects or entities used to repre- 
sent the system and the relationships that link these objects to each other 
and their environment. When establishing the entities, scientists must address 
three questions: (1) which entities should be included, (2) how detailed should 
they be, 3) and how should they be represented? The answers determine ob- 
ject aggregations and system boundaries, bo th of which are rarely obvious 
and can significantly influence model results ( Abarca- Arenas and Ulanowiczl . 
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2002llAhl and Allenl.ll996l:ICale and Odelill979tlGardner et al.Lll982l : lLoehlel . 
1987al : lRastetter et al.Lll992h . After denning the objects, the scientist can state 



their relationships by deciding which ones to include and how to model them. 
This task requires the specification of each relationship's mathematical formu- 
lation, which often involves selection from several possibilities. For example, 
the Lotka-Volterra, Ivlev, or Holling Type III functions each model predation, 
but the different formulations make different claims about how the process op- 
erates. Finally, the scientist must set the numeric parameters. In some cases, 
empirical estimat es of parameter values exist , but more often the values are 
largely unknown ( Beck L 1987l: Reckhow . 1994). Uncertainty enters the model 

ieckl.ll 



at each decision point ( Beck. Il987l:lj0rgensen and Bendoricchicl . 2001 : Loehld . 



1987al iReckSwI . ll994lRevnolds and FordL Il999h .' complicating the search for 



plausible explanations. 



Searching for models presents two additional challenges, one related to the 
selection criteria and the other centered on the s earch procedure. Scientists 
require criteria for ranking and evaluating models (iHilborn and Mangell. 1*1 997; 



Jost and Arditi , 2001 ; Oresked . 19981: Oreskes et al. , 11994 iRevnolds and Ford , 
1999^ . which usually include one or more quantitative measures of goodness-of- 



fit of the predictions to observed data. However, selecting a model based on its 
accuracy alone is insufficie nt since very different models can generate similar 
behavior ( Cale et al. . 19891 ). To overcome this problem, additional criteria such 
as a model's complexity, uncertainty and generality may be used. If the model 
should e xplain system behavior, then its structure must also be sufficiently 
realistic (Bossel, 19921 : Levinsl . 19661 : Zeigler . 1974 ). The second challenge is 
that the search procedure is cumbersome. In ecology, this search is typically a 
manual effort guided by an expert's domain knowledge (e.g., aquatic ecosys- 
tems) and modeling experience, along with the data. Given a model structure 
and mathematical formulation, some methodolog ies assist with fitting parame- 
ters and quantifying parameter uncertainty (e.g., Hornberger and Spear . 1980: 



tcrs and quantilymg parameter uncertainty e.g., riornberger and bpcar, 1 
Osidele and BeckL 12004 ISaltelli et allkOOOHSpear and Hornbergerl .ll980). 



Nev- 
ertheless, search through the space of model structures, mathematical formula- 
tions, and parameter values remains a challenging and time-consuming chore. 



In this paper we describe a method for representing and building models de- 
signed to: 

• facilitate construction of process-based models; 

• expedite search through the space of candidate model structures; 

• root model development in domain theory; and 

• bind models to empirical observations. 

We first describe a new formalism for representing models as interacting sets 
of entities and processes. We claim that this formalism captures how scientists 
understand complex system dynamics, and therefore eases model communica- 
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tion. This representation also simplifies the comparison of a model's structure 
to the relevant domain theory. We then show how this formalism facilitates 
search through the space of plausible model structures and illustrate the use 
of this approach by re-representing an existing model of the Ross Sea. Finally, 
we discuss related work and propose some directions for future research. 



2 Process Modeling 

In this section, we introduce the method for representing and constructing 
explanatory models. The approach has two core elements: entities, which are 
the objects or actors in the system, and processes, which are the actions or 
activities of the entities that generate system dynamics. Abstract forms of 
processes and entities encode domain knowledge, which is then used to con- 
struct models with realistic structure. Scientists combine instantiated versions 
of the abstract elements to construct models of specific systems. We conclude 
this section by briefly describing software we are developing to support this 
modeling approach. 

2. 1 Entities 

In process models, entities are actors and receivers of action that are charac- 
terized by a combination of variables and parameters. For example, in a soil 
ecosystem model the collection of nematodes could be an entity with variables 
that describe its total carbon concentration and the number of individual or- 
ganisms. Depending on the model objectives and the processes included, a 
variety of parameters associated with the nematodes may be of interest in- 
cluding their maximum intrinsic growth rate, carrying capacity, and death 
rate. 

In ecological models, entities are rarely differentiated from variables, which 
works well when there is only one state variable for each entity. In such cases, 
ecologists commonly associate parameters with an entity by using the state 
variable's name as a subscript (see Section 3 for an example). However, making 
entities explicit provides a natural way to group variables and parameters and 
more closely resembles how scientists think about real systems. 

To make entities explicit in the process modeling representation, a scientist 
specifies a set of generic entity types, each of which defines the properties of a 
class of objects. Multiple instantiated versions of these generic entities can be 
included in a model. For example, an ecologist could create a generic entity for 
birds, and then instantiate it as a sparrow or a hawk by changing the values 
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Table 1 

The syntax for defining a generic entity type. 



entity name 
description "description" 0, 
variables 

variable_name\ {combining scheme} , "description" 
variablejname2 {combining scheme}, "description" 
parameters 

parameter jnamei [range] , "description" 
parameter _name2 [range], "description" 

"Descriptions can be any text. 

^Combining schemes state how the effects of multiple processes operating on a variables will be aggregated. 
c Parameter ranges delineate legal values, which arc determined from mathematical constraints, domain 
theory, and empirical observations. 

of class properties. 



Todorovski et all (|2005h report an initial formalism for specifying entities in 



process models that we build upon here in Table 1. The formal generic entity 
has a name and a set of properties. Entities may contain both variables and 
parameters, where variables change in the context of the model and parameters 
do not. Variables must have a name and a rule that determines how the effects 
of multiple processes are aggregated (e.g., summed, multiplied). Parameters 
must have a name and an interval that delineates their possible values. For 
both variables and parameters, there is an optional slot to provide a brief 
description. In instantiated entities the variables are either associated with 
data or given initial values, the parameter values are assigned real values, and 
a field following the name indicates the parent generic entity. 

To use an entity in a model, we need a notation that allows access to its fields. 
We have chosen a dot notation that concatenates the variable or parameter 
name with the entity name. For example, to refer to the chlorophyll a con- 
centration (chla) of an instantiated phytoplankton entity (Phaeocystis), we 
write chla. Phaeocystis. 



2.2 Processes 



Processes are the physical, chemical, or biological actions that drive change 
in dynamic models. For example, growth is a biological process that occurs in 
many ecological models, whereas oxidation-reduction, photolysis, and sorp- 
tion are examples of chemical and physical processes from biogeochemistry. 
The process modeling framework employs two forms of processes: generic and 
instantiated. Table 2 shows the syntax for generic processes, which define the 
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Table 2 

The syntax for defining a generic process. 



generic process name 
relates rolei {entity type, entity type}" 

role2 {entity type, entity type} 
parameters <set of process specific parameters> 
equations <set of differential and algebraic equations > b 
conditions <set of boolean conditions> c 

"Each role may be played by multiple objects instantiated from different generic entities. 
ft Equat ions describe the effects of the process on the entities. 
c Conditions control when the process is active. 

basic properties of a class of processes. Generic processes must include a name, 
a statement of which generic entities or entity types can be involved, and a set 
of equations. The relates statement identifies unique entity roles in the process 
and the entity types that can fill those roles. A generic process can also include 
a set of Boolean conditions that control whether the process is inactive. For 
example, we could set the conditions so that the variable light must be pos- 
itive for the photosyntheses process to occur {light. environment > 0). Such 
statements turn processes on and off, making the model structurally dynamic. 
The set of generic processes are collected with the generic entities into one 
library. 

The second form of process is the instantiated version of a generic process, 
which is bound to specific entities and has specific values for parameters. 
Each instantiated process has a specific name and instantiated entities fill 
the roles in the equations. The process naming scheme first identifies the 
process and then the entities the process affects. For example, we might 
name an exponential growth process that operates on a phytoplankton en- 
tity 'exp_growth_phyto'. 

Table 3 illustrates the difference between instantiated and generic processes. 
Both representations refer to an exponential death process. The generic pro- 
cess specifies two roles that entities play in this process. The first role, played 
by the organism that dies, can be filled by either the phytoplankton or zoo- 
plankton entity types. The equations indicate that these types must contain 
a variable called cone and a parameter named dr. The second role can only 
be filled by the detritus entity type, which must also have a cone variable. 
In the instantiated process, the name of the instantiated entity replaces the 
role name in the equations. Thus, the first role was filled by a phytoplankton 
entity named Phaeocystis, whereas the second role was filled by the detritus 
entity named Detritus. 

A key distinction of this representation is that elements of the model equations 
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Table 3 

An example of a generic process for exponential death and an instantiated version 
of the process. 

Generic Process 

generic process death_exponential 
relates £l{phytoplankton, zooplankton}, S2{detritus} 
equations d[conc.51,t,l] = -1 * dr. SI * conc.Sl 
d[conc.S2,t,l] = dr. SI * conc.Sl 

Instantiated Process 

process death_exponential_Phaeo 
equations d[conc.Phaeocystis,t,l] = -1 * dr.Phaeocystis * conc.Phaeocystis 
d[conc.Detritus,t,l] = dr.Phaeocystis * conc.Phaeocystis 



that pertain to a process are grouped together. This organization communi- 
cates how a process is formulated, what entities drive it, and how it affects 
particular entities. This process centered association also facilitates the assem- 
bly of the system of equations for a model from these fragments. 



2.3 Modeling Procedure 



The modeling procedure described here resem bles the traditional recipe found 



in many texts on quantitative mo deling (e.g., iGoldl Il977t iGrant et all 11997 



Jeirgensen and Bendoricchiol 2001 ). but it bears two distinctive features. First 



the procedure is organized as a search through a space of candidate models 
for those that explain a set of empirical observations. Second, the method 
assembles candidate models from libraries of generic entities and processes. To 
implement this approach, we must define the search space, describe a search 
mechanism, and specify criteria for selecting among alternative models. In 
this paper we assume that the structural search is exhaustive, the parameter 
search is accomplished using existing algorithms such as gradient descent, 
and that the selection criteria include quantitative measures of goodness-of-fit 
(e.g., sum of squared errors, r 2 ) and the realism of the model structure. These 
assumptions allow us to focus our discussion on the search space. 

As described in the introduction, the search space is determined by alternative 
model structures and sets of parameters. While this space can be quite large, 
our method of representing and developing models constrains it in two ways. 
First, a model's structure comprises entities linked by processes. The space 
of possible model structures is circumscribed by a domain-specific library of 
generic entities and processes thought to occur in the system, and specific 
entities to be related. The generic types dictate how instantiated processes and 
entities can be combined. Second, the entity types specify the possible ranges 
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for each parameter, limiting the parameter space. The range defined in the 
generic entity may be wide if we have little knowledge about the appropriate 
values, but additional knowledge, such as empirical estimates, can support a 
narrower range. 



In the ideal case, one or more libraries of generic entities and processes will 
already exist for the domain being investigated. In that case, the challenge 
is to select the most relevant library and adapt it as needed. For example, it 
will be necessary to adapt a library when observations of the physical system 
suggest alternative entity definitions, or when the modeling objective is to 
determine the significance of a conjectured new process. 



If no appropriate library exists, then an investigator must construct one to 
support the modeling task. Although this requirement introduces an addi- 
tional step to the modeling task, we believe that the advantages of using an 
explicit library of entities and processes outweigh the cost of its construc- 
tion. Specifically, the generic entities and processes define the search space. 
From the generic definitions, the library determines the possible entities and 
processes the model can include, their possible structural relationships, alter- 
native mathematical formulations, and the parameter ranges. Further, to the 
degree that the library reflects existing domain knowledge, it also determines 
the space of theoretically plausible or realistic models. In addition, a key idea 
in this approach is that these libraries form a repository of domain knowledge 
or theory that can be reused for different modeling projects. We expect this to 
both expedite the modeling process and constrain model structures to reflect 
existing domain knowledge. At the very least, we expect it will ease review of 
the domain theory and modeling assumptions used. 



Given the space of plausible models, the investigator must search through it 
for models that explain the empirical observations. This search has three steps: 



(1) bind entities to selected generic processes and combine these components 
into a model structure; 

(2) use estimation routine to identify parameters that generate predictions 
that best fit the observed variables; and 

(3) evaluate the resultant model's performance using a set of model selection 
criteria. 



In this way, multiple alternative models are constructed and evaluated. This 
model search can be implemented by hand, but the formalism also allows 
both the structure and parameter search to be automat ed. Automati c mode l 
inductio n is b e yond the scope of th i s paper, but s ee George et al. ( 2003j ); 
Landevi (|2000h : iLanriev et all (|2003l 120021 lin presd ) for details on how this 



can be accomplished. 
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2.4 Prometheus 



To encourage the process modeling approach, we are developing a software 

environment called Prometheus that is designed to support m odel building 

from conceptual development to evaluation, use, and publication ((Sanchez and Langlev . 
This program has six principal features. First, it provides an interactive 



way for a user to add and edit generic processes to a domain library. Second, 
it allows manual construction of instantiated models from a domain library 
in an interactive and graphic environment. For example, to add a process the 
user selects a generic process from the library, which opens a dialogue box that 
guides the user through binding the entity roles and parameters in the generic 
process to specific entities in the model. The user can also specify the values 
of process level parameters in this dialogue. Given an instantiated model, a 
third feature of the program is to assemble and simulate the model equations. 
The user can then view the equations and graph the simulated trajectories. 
The fourth feature is Prometheus' ability to automatically search through 
candidate models derived from a library. The program returns a reduced set of 
the models with trajectories that best match the observed data based on sum 
of squared errors. A fifth feature is the ability to automatically revise an ex- 
isting model. Through an interactive dialogue, the user can ask Prometheus 
to add or delete whole p rocesses, and to revise parameters within processes. 



Asgfiarbevgi et al.l ([20061 ) discuss this ability in more detail. The final feature 



is a graphical representation of the model structure in which entity variables 
are linked through processes. This representation of instantiated models pro- 
vides another way for users to view and analyze their models (see Section 3.2.3 
for more details on the model diagrams). Collectively, these features provide 
a comprehensive toolbox for process modeling. 



3 Ross Sea Ecosystem Model 



In this section, we illustrate the process modeling approach with a version of 
the Ross Sea ec osystem embedded i n the Ross Sea Coup l ed Ice And Ocean 
(CIAO) model (|Arrigo et all . 120031: IWorthen and Arrigol . |2004). The origi- 
nal ecosystem model has six state variables, including P. antarctica (Pi), 
diatoms (-P2), detritus (D), zooplankton (Z), nitrate (NO3) and iron (Fe). 
In CIAO, this ecosystem model is coupled to a modified version of the Prince- 
ton Oceanographic Model, which is a physical ocean model that predicts 
the three dimensional structure of temperature, salinity, and velocity in re- 
sponse to b oth surface and la teral boundary conditions of heat, salt, and 



momentum. 



Arrigo et al. ( 2003? ) used CIAO to investigate factors controlling 



phytoplankton production and taxonomic composition in the Ross Sea, and 
Worthen and Arrigol (|2004j ) used it to explore the interannual variability of 
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primary production. More recently, iTagliabue and Arrig added more 



detailed mechanisms for iron cycling, the carbonate system, and the air-sea 
CO2 exchange. They used this enhanced model to investigate the sensitivity 
of the ecosystem dynamics to changes in taxon-specific nutrient utilization pa- 
rameters. They also explored the impact of diffe ring iron fertilization regime s 
on the abilit y of the system to sequeste r carbon ( Arrigo and TagliabueL 20051 ). 



In addition, Asgharbevgi et al. ( 20061 ) employed a modified version of the 



ecosystem model to demonstrate the usefulness of new inductive tools for 
model revision. 

To more clearly demonstrate the process modeling approach, we simplified 
the Ross Sea ecosystem model in two ways. First, we extracted the ecosys- 
tem model from CIAO, and focused on activity near the ocean surface. In 
this simplified model, there are three driving variables: daily mean photosyn- 
thetically usable radiation at the ocean surface (PUR e ) and center (PUR C ) 
of the modeled volume and water temperature {Th 2 o)- Second, we represent 
phytoplankton with one state-variable (P) rather than two. The variables C, 
Z, and D represent the concentration of carbon in the three entities, and NO% 
and Fe represent the concentration of nitrate and iron, respectively. 



3.1 Differential and Algebraic Equation Representation 



As is common in ecological modeling, the original Ross Sea ecosystem model 
was originally formulated as a set of ordinary differential and algebraic equa- 
tions, which were implemented as a Fortran program. The modified differential 
equations for the five state- variables are 



dP 

~dt 
dZ 

~dl 
dD 

~~dt 
dN0 3 

dt 

dFe 



dt 



((l — ep)ti — xp-Wp)P — gZ, (1) 

(jz9 -x z - r z )Z, (2) 

(1 - f3) [(1 - lz )gZ + x P P + x z Z] - (r D + co D )D, (3) 

~4>(p,n/c)^P, and (4) 

4>(D,Fe/C)rDD - <j)(P,Fe/N)VP- (5) 



In equation 1, the balance between the rates of growth (//), death (xp), ex- 
udation (ep), grazing (g), and sinking (up) determine the change in phyto- 
plankton concentration (P). The product of the grazing rate (g), the zoo- 
plankton assimilation efficiency (jz), an d the phytoplankton being grazed, as 
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well as the zooplankton death (xz) and respiration (rz) rates determine the 
changes in zooplankton (Z) abundance as shown in equation 2. Additions to 
the detrital (D) pool, modeled in equation 3, include the particulate fractions 
(1 — P) of unassimilated grazing (i.e., fecal pellets) and dead phytoplankton 
and zooplankton. Losses are from remineralization (t\d) and sinking (u>d)- Phy- 
toplankton nitrate uptake is assumed to be proportional to the magnitude of 
phytoplankton growth (/i-P), and determines the Nitrate (NO3) concentration 
as equation 4 illustrates. We then multiply this quantity by the phytoplank- 
ton nitrogen to carbon ratio {4>{p,n/c))- There are no significant nitrate inputs 
during the growing season. In equation 5, iron concentration (Fe) is a function 
of detrital remineralization (adjusted by the Fe/C ratio, <f)(D,Fe/c)) and phyto- 
plankton production multiplied by the phytoplankton Fe/C ratio ((f>(p t Fe/c))- 

Phytoplankton growth rate (//) and zooplankton grazing rate (g) are variables 
in this model. We formulated the phytoplankton growth rate as 

/i = fiO max ■ e (°- 06933 - T H 2 o) . m i n (ikJim, nJim, feJim), (6) 

where nO max is the maximum unlimited growth rate at 0°C and Th 2 o is the 
water temperature. Light and nutrients limit growth as if they were substi- 
tutable resources. That is, only the minimum value of these resources retards 
the maximum growth rate. We calculated light limitation (ikJim) as 

ikJim = a - e ^- puR ^ k) ■ e (-PUR c / P hotoinhib) (7a) 



/ {p URe . e (1.089-l.l2-lo S10 (ik max ))V 

%k = iK max /[ a + 2.50 • eV >\ 



(7b) 



where a is a fitting parameter, PU R c and PUR e are the mean daily photosy- 
thetically usable radiation (/imol photons mT 2 s _1 ) in the center of the mixed 
ocean layer and at the mixed layer edges, respectively, photoinhib represents 
the negative impact of high light on phytoplankton growth, and ik max is the 
upper limit of ik. Nitrate (nJim) and iron (feJim) limitation are modeled 
with Monod functions, giving 

nMm = NO, 

N0 3 + k N03 
Fe 

feJim = — ■ - — , (8b) 
Fe + k Fe 

where k^Oz and k Fe are the half-saturation constants for nitrate and iron, 
respectively. 

Grazing assumes a Holling Type II relationship, 

g = (9a) 

V + dcap 

i] = P - (biomin + g Hm ), (9b) 
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where g max is the maximum grazing rate of Z on P, g cap is the capture rate, 
and 7] represents the phytoplankton concentration susceptible to grazing. Here, 
biomin is a modeling measure to ensure that phytoplankton concentrations 
remain positive, and gu m represents the biological limitation of zooplankton 
grazing on phytoplankton. 

Representing the model as a system of differential and algebraic equations 
has advantages, in that it clearly identifies the model variables and the full 
equations such that translating them into computer code for simulation would 
be relatively trivial. The differential equations imply the entities Phytoplank- 
ton, Zooplankton, Detritus, Nitrogen, and Iron. Parameters associated with 
an entity have its state-variable as a subscript. Although the entities are im- 
plicit, the identities of the causal processes are obscured. Close inspection of 
the equations reveals that certain elements correspond to specific physical, 
chemical, and biological processes. Unfortunately, no specific markers for the 
processes exist, and in some cases the processes include elements from multiple 
equations. 



3.2 Process Modeling Representation 

Here, we illustrate the process modeling approach and provide a corresponding 
representation of the Ross Sea ecosystem. We designed the modeling procedure 
to facilitate the construction of new models, but the task here differs some- 
what in that we are translating an existing model. This presents additional 
challenges that highlight important features of the approach. 

As discussed in Section 2.3, the first step of the process modeling procedure 
is to create or to select a library of entity types and generic processes. In this 
case, we need to generate a new library; therefore, we must identify the entities, 
entity types, processes and generic processes we expect to be significant in the 
model. Here, we infer these items from the existing equations. We conclude 
this section by demonstrating how to build the instantiated model of the Ross 
Sea ecosystem using the library. 

3.2.1 Entities 

As already stated, entities are implicit in the original Ross Sea model; We 
can infer them from the equations by examining the variables and parame- 
ters. Each of the variables in the left hand side of the differential equations 
(1-5) describe separate entities that are associated with parameters through 
subscripts. In addition, we associate the driving variables (PUR e , PUR C and 
Th 2 o) with a separate entity we label Environment. Further, we assign the 
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variables characterizing phytoplankton growth rate (/i) and light and nutrient 
growth rate limitations to the phytoplankton entity. 

The grazing rate (g) presents complications. It forms from the interaction 
of two entities, and it does not necessarily belong to either one. We might 
associate it with a grazing process, but this is problematic. We show later that 
we need this variable to operate in multiple processes, but in our framework 
variables and parameters associated with a process cannot be accessed by 
another process. We could create a new entity to house this variable, but for 
now we associate it with Environment. 

The next task is to determine if these entities are unique types, or if one or 
more might be derived from the same general class. To make this decision, we 
examined the parameters associated with each entity along with the roles the 
entities play in the model. Since nitrate and iron are both nutrients and operate 
similarly in the model, we decided to create generic entity called Nutrient. The 
remaining entities have unique types. 

Tables 4 and 5 define the entity types (Phytoplankton, Zooplankton, Detritus, 
Nutrient, and Environment) using the process modeling formalism. Presently, 
Phytoplankton and Environment are the only entity types with more than 
one variable, which reflects the central roles they play in the Ross Sea model. 
In addition to the driving variables and the grazing rate, we associated the 
parameter f3 with Environment. This parameter determines the soluble and 
insoluble fractions of organic matter, and supplants modeling the detailed 
processes of adsorption, flocking, and dissolution. While we designed these 
five entity types to support the Ross Sea model, they could easily be modified 
to support other aquatic ecosystem models. 



3.2.2 Processes 

To generate a generic process library we must know what physical, chemi- 
cal, and biological processes might be relevant in the target model. Again, 
we analyzed equations 1-9 to identify the processes required in our library. 
Our knowledge of the system, aquatic ecosystems in general, and ecological 
theory guided this analysis and the construction of the library. We present the 
resulting generic process library in Tables 6 and 7. 

In our analysis, we found that parameters in the original model guided the 
identification of processes because they were originally selected to represent 
particular aspects of the real system. To use these tags, we fully expanded 
the differential equations and marked equation elements that seemed to cor- 
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Table 4 

Entity types for the Ross Sea process model. 



entity Phytoplankton 

description This entity is a composite of the phytoplankton species, 
variables 

cone {sum}, carbon concentration of phytoplankton 
[X {sum}, realized growth rate 

ikJim {sum}, light limitation of maximum growth rate 
nJim {sum}, nitrogen limitation of maximum growth rate 
feJim {sum}, iron limitation of maximum growth rate 
parameters 

I^Qmax [0,1], maximum unlimited phytoplankton growth rate at 0°C 

e [0,1], exudation rate 

x [0,1], death rate 

ik max [0,oo], upper limit of ik 

ll> [0,1], sinking rate 

biomin [0,oo], minimum carbon concentration 
bioinhib [0,oo], photoinhibition of phytoplankton growth 
4>(N/C) [0,1], ratio of nitrogen to carbon in phytoplankton 
4>{Fe/C) [0,1], ratio of iron to carbon in phytoplankton 

entity Zooplankton 

description This entity is a composite of the zooplankton species 
variables 

cone {sum}, carbon concentration of zooplankton 
parameters 

7 [0,1], assimilation efficiency 

x [0,1], death rate 

r [0,1], respiration rate 

uj [0,1], sinking rate 

9max [0,1], maximum grazing rate 

9lim 

[0,oo], grazing limitation 
g cap [0,oo], grazing capacity 

respond to processes. For example, we rewrote and labeled equation 1 as 
dP 

— = fiP - e/iP -xpP^- ujpP- gZ^ . (10) 

Growth Exudation Death Sinking Grazing 



We initially identified five processes in this equation: growth, exudation, death, 
sinking, and grazing. However, growth and exudation are not independent 
in the equation formulation because exudation is a constant proportion of 
the amount of new growth (/iP). When we applied the same procedure to 
the remaining differential equations we found that what we initially identi- 
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Table 5 

Entity types for the Ross Sea process model (continued). 



entity Detritus 
description suspended particulate organic matter 
variables 

cone {sum}, carbon concentration of detritus 
parameters 

r [0,1], remineralization rate of detritus 
uj [0,1], sinking rate 

4>{N/C) [0)1]) ratio of nitrogen to carbon in detritus 

entity Nutrient 
description dissolved nutrients in the water column 
variables 

cone {sum}, dissolved nitrate concentration 

entity Environment 

description the system environment 
variables 

Th 2 o {sum}, sea water temperature 

PUR C {sum}, average daily light intensity at the center of the modeled volume 
PUR e {sum}, average daily light intensity at the top edge of the modeled 
volume 

g {sum}, rate of zooplankton grazing on phytoplankton 
parameters 

(3 [0,1], proportion of matter that becomes soluble 



fled as a nutrient uptake process in equations (4) and (5) (—(p^c/^f^P and 
—(p(p,c/Fe)^P, respectively) was also formulated as a direct proportion of the 
amount grown. Therefore, we treated exudation and nutrient uptake as sub- 
processes of growth, and bundled them into one phytoplankton growth process. 
We made this decision to improve the understandability and causal flow of the 
process model. 

Although we divided the differential equations into different processes, we 
generally mapped the algebraic equations (6-9) into separate processes (Ta- 
ble 7). While bundling the calculation of the grazing rate (9) into the grazing 
process, and the equations for growth rate (6) with its nutrient and light limi- 
tations (7a, 8a, 8b) is conceptually reasonable, we chose to disaggregate these 
processes so that we could track them individually. In addition, segregating 
these limitation processes lets us easily construct alternative processes with 
a different formulation. For example, we could construct an alternative light 
limitation process that does not include the photoinhibition term. In total, we 
encoded 11 generic processes that could operate in the Ross Sea ecosystem 
model. 
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Table 6 

Generic process library used to construct a revised Ross Sea ecosystem model. 



library RossSea 

generic process growth_exp 
relates P {Phytoplankton}, NOz{Nutrient}, Fe{Nutrient} 
equations d[conc.P, t, 1] = (1 — e.P) * fi.P * conc.P 

d[conc.NOs,t, 1] = — 1 * 4>(n/c)-P * fJ-P * conc.P 

d[conc.Fe, t, 1] = — 1 * <fr(Fe/c)-P * h-P * conc.P 

generic process death_exp 
relates S{Phytoplankton, Zooplankton}, D{Detritus}, E {Environment} 
equations d[conc.S, t,l] = — 1 * x.S * conc.S 

d[conc.D,t, 1] = (1 — @.E)x.S * conc.S 

generic process grazing 
relates P{Phytoplankton}, Z {Z ooplankton} , D{Detritus}, E '{Environment} 
equations d[conc.P, t, 1] = — 1 * g.E * conc.Z 

d[Z, t, 1] = j.Z * g.E * conc.Z 

d[D, t, 1] = (1 - 0.E) * (1 - j.Z) * g.E * conc.Z 

generic process respiration 

relates Z {Z ooplankton} 

equations d[conc.Z, t, 1] = — 1 * r.Z * conc.Z 
generic process remineralization 

relates D{Detritus}, N{Nutirent} 

equations d[conc.D, t, 1] = — 1 * r.D * conc.D 

d[conc.N,t, 1] = 4>(n/c)-D * r.D * conc.D 
generic process sinking 

relates V '{Phytoplankton, Detritus} 

equations d[conc.V, t, 1] = — 1 * uj.V * conc.V 

generic process growth_rate 
relates E '{Environment}, P {Phytoplankton} 
equations fi.P = (uO m ax-P * exp(0.0633 * Th 2 o-E)) * 
min(ikJim.P, nJim.P, feJim.P) 



The process modeling formalism builds a new layer of information into the 
model that reflects additional modeling decisions regarding entities and pro- 
cesses. The growth process provides a good example. As we described ear- 
lier, we chose to include exudation and nutrient uptake in the phytoplank- 
ton growth process because they were not modeled as independent processes. 
However, this decision obscures the identity of these subprocesses. If the mod- 
eling task required that we maintain their identities, we could define a new 
variable associated with the phytoplankton entity to represent the instanta- 
neous amount of new phytoplankton growth (new -growth. P) . With this new 
variable, we show how we could formulate separate processes for exudation 
and the nutrient uptake in Table 8. This technique of identifying hidden vari- 
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Table 7 

Generic process library used to construct a revised Ross Sea ecosystem model (Con- 
tinued). 

generic process growth_limitation_light_w_photoinhibition 
relates P{Phytoplankton}, E '{Environment} 
parameters a [0,2], fitting parameter 
equations ikJim.P = a — exp(—PUR c .E/ 

(ik max .P/(a+2.50*exp(PUR e .E*exp(1.089-1.12*log w (ik max .P))))))* 
exp(—PU R C .E / photoinhib.P) 

generic process monodJimitation 
variables V {Nutrient] 
parameters k, half-saturation constant 
equations mJim.V = con.V /(conc.V + k) 

generic process grazing_rate 
variables P{Phytoplankton} , Z{Zooplankton}, E '{Environment} 
equations g.E = (g max .Z * (conc.P - (biomin.P + gu m .Z)))/ 
(gca P -Z + (conc.P - (biomin.P + g hm .Z))) 



ables is especially useful when multiple processes that might be considered 
sub-processes impact the product of a process, or when we want a series of 
sub-processes to be explicit. These additional decisions make it a non-trivial 
task to translate from equations to entities and processes or to construct an 
initial library, while assembling the equations from entities and processes is 
simple. 

3.2.3 Instantiated Model 

Given a library of generic entities and processes, we instantiate candidate 
models using two steps. In the first step, we specify the instantiated entities. 

Table 8 

Alternative processes to explicitly model growth, exudation, and nutrient uptake 

process growth_phyto 
equations d[conc.P, t, 1] = \x * conc.P 
new_growth.P = u * conc.P 

process exudation 
equations d[conc.P, t, 1] = — 1 * e.P * new -growth. P 

process growth_nutrient_uptake_N03 
equations d[conc.NOz,t,\\ = —<P(n/c)-P * new-growth.P 

process growth_nutrient_uptake_Fe 
equations d[conc.Fe,t, 1] = —4>(Fe/c)-P * new-growth.P 
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For the Ross Sea model, we instantiated one entity for Phytoplankton (P), 
Zooplankton (Z), Detritus (D), and Environment (E). We also instantiated 
two Nutrient entities named N03 and Fe. In the second step, we bind specific 
instances of generic processes to members of the entity set. Then, using the 
modeling procedure described in Section 2.3, we can consider multiple mod- 
els with different structures. With the library in Tables 6 and 7, one of the 
possible candidate models will have the same structure as the original Ross 
Sea model, while the remaining candidates will have a subset of the original 
model's structure. We could expand the number of candidate models and in- 
crease the number of processes in the model by adding new generic processes 
or alternative formulations of existing ones to the library. 

We present a portion of the the original Ross Sea ecosystem model specified 
with this formalism in Table 9. The instantiated model begins with the model 
name, after which come the instantiated entities and their types. 1 The next 
line declares the observed variables (i.e., those for which data are available). 
The set of instantiated processes follows. The complete instantiated model 
(not shown) contains thirteen processes. Although this view of the model is 
not as concise as the final set of equations, it conveys more information about 
the model structure. 

Figure 1 shows the causal graph of the full Ross Sea process model drawn 
by Prometheus. Model variables are represented as ovals, processes are 
rectangles, and influence is shown by arrows. This diagram shows explic- 
itly which processes and variables are included in the model, how they are 
directly linked, and where indirect connections occur. The ontological com- 
mitments forbid the direct connection of two variables. Instead, the influence 
must be mapped through specific processes. This lets us explicitly map two 
or more minimum causal paths (variablei — > process — > variable-i) between 
two variables. Inspection of the diagram reveals three feedback loops. The 
first, conc.P — > grazing_rate — * g.E — > grazing — > conc.P, shows the density 
dependence of grazing on the phytoplankton. Two additional loops show the 
feedback of phytoplankton growth into itself through the two nutrient entities 
(e.g., growth_Phyto — > conc.N03 — > monod_limitation_nitrate — > nJim.P — > 
growth_rate_phyto — > fi.P — > growth_Phyto). Feedback loops are difficult to 
see in either the equation or process model representations alone, but they 
stand out in the network representation. 

Along with recovering the original model, the process modeling approach lets 
users explore alternative model structures with relative ease. Exhaustively 
searching the library, we would also find many subsets of the original model 
that may fit the observed data. Consider the alternative structure suggested 



In this example we retain the parameter symbols, rather than providing parameter 
values 
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Table 9 

A select portion of the instantiated Ross Sea Ecosystem model represented in the 
process modeling formalism. Parameter values are not shown. 

model RossSea 

entities P{Phytoplankton} , Z {Z ooplankton} , D{Detritus}, N 0%{N itrogen} , 
Fe{Iron}, E '{Environment} 

observable conc.P, conc.NOs 

exogenous T H2 o-E, PUR C .E, PUR e .E 

process growth_phyto 
equations d[conc.P, t, 1] = (1 — e.P) * (i.P * conc.P 

d[conc.NOs,t, 1] = —<j) N / c .P * y,.P * conc.P 
d[conc.Fe, t, 1] = —cj) Fe / c .P * u.P * conc.P 

process death_phyto 
equations d[conc.P,t, 1] = — x.P * conc.P 

d[conc.D, t, 1] = (1 — (3.E) * x.P * conc.P 

process death_zoo 
equations d[conc.Z, t, 1] = — x.Z * conc.Z 

d[conc.D, t, 1] = (1 — (3.E) * x.Z * conc.Z 

process grazing 
equations d[conc.P, t, 1] = — g.E * conc.zoo 

d[conc.Z, t, 1] = j.Z * g.E * conc.Z 

d[conc.D, t, 1] = (1 - (3.E) * (1 - j.Z) * g.E * conc.Z 

process remineralization_iron 
equations d[conc.D, t, 1] = — r.D * conc.D 

d[conc.Fe, t, 1] = (j) Fe / c .D * r.D * conc.D 



by Tagliabue and Arrigoh s (2003) finding that zooplankton may not be a sig- 



nificant factor in the plankton dynamics of the Ross Sea. To explore this 
possibility, we could instantiate a model that excludes the zooplankton entity 
and its related processes. Within our framework, we would then compare the 
performance of this model to the original one to help determine which is most 
appropriate. If we wanted to expand the model so that it differentiates between 
P. antarctica and the diatom phytoplankton, we could simply instantiate the 
phytoplankton entity type twice (e.g., replace P {Phytoplankton} in the entity 
declaration line with phaeo{Phytoplankton} and diatoms {Phytoplankton}) 
and then instantiate the processes associated with phytoplankton a second 
time, assigning one set to each phytoplankton group. If the scientists do not 
believe the same processes impact the two groups, they can search through 
alternative process instantiations. Again, we can add new generic processes 
or alternative formulations of generic processes to the library and implement 
them with little difficulty. The ease with which users can develop and evaluate 
alternative model structures is one of the primary advantages of the process 
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Fig. 1. Graph of the process representation of the Ross Sea ecosystem model. Vari- 
ables are represented as ovals and processes as rectangles. Arrows indicate direct 
causal influence between processes and entity variables. Light arrows indicate an al- 
gebraic relations, while dark arrows signify differential equation-based relationship. 
Observed and driving variables are highlighted with a bold outline. 

model formulation. 



4 Related Research 



The process modeling framework builds upon and relates to ideas already 
present in the literature. Here, we characterize the work's relevance and nov- 
elty. 

Modeling the processes th at generate system dynamics is no t a new idea in 
ecology. In their textbook, Jetrgensen and Bendoricchiol ( 200lj ) make a strong 
commitment to modeling specific processes. In Chapter 3, the authors de- 
scribe several mathematical formulations of physical, ch emical, and biologi- 



cal pr ocesses that commonly occur in ecological models. iGurney and Nisbet 
(1998) also discuss the importance of processes in modeling ecological dy- 
namics, stating "formulation of a dynamic model always starts by identify- 
ing the fundamental processes in the system under investigation." However, 
both texts switch to sets of equations after processes are considered during 
model formulation. An additional emphasis on processes in ecolo gical mod- 
eling appears in the growing use of process-based models (e.g., [B rugnach 



2005 : iGalbraith et all Il98(l iLandsberer and Warms! Il997r. iMcMurtriel 



1985 



an 



Melillo et all Il993t iRevnolds and Fordl . Il999l : ISimioni et all l2000i ). With 
emphasis on processes, t hese model s resem ble those built with the approach 
described in this paper. Brugnach ( 2005i ) shows clearly that process-based 
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models are conceptualized as a network of data flowing between processes 
that operate on the data. However, model actors or entities are not clearly 
delineated in this approach. 



In an effort to create a general theory of modeling and simulation, IZeigler 
( 1974 ) presents a framework that is very similar to the representation we 
describe. He describes models using three elements: components, descriptive 
variables, and component interactions. Components are the parts of the sys- 
tem from which the model is constructed. Variables (and parameters) char- 
acterize the properties of the individual components, and collectively portray 
the system's behavior. Component interactions are the relationships linking 
components. From this description, it is seems that entities in our framework 
captures both Zeigler's components and descriptive variables. Our processes 
are a subset of Zeigler's component interactions. The difference between our 
representation and Zeigler's approach stems from his desire to construct a gen- 
eral theory of modeling. In contrast, we focus on constructing continuous-time, 
simulation models of mechanisms. 



Our co nceptual framework is perhaps most similar to one described bv lMachamer et al 
(2000) m their discussion of mechanisms in biology. They claim that mecha- 
nisms are composed of entities and activities. As in our framework, entities 
are the actors in a system, both creating and being affected by activities, 
which produce change in the system. Their activities seem equivalent to our 
processes, although the authors discuss and apply the framework to scientific 
reasoning in neurobiology and molecular biology. They do not show how it 
relates to the mathematical models that arise in ecology, and they do not 
develop their ideas into a clear working methodology. 



As in the process modeling approach, some ecological research generates and 
tests alternative possibilities. At the broadest level, if we consider each candi- 
date model structure as a hypothesis of the system's structure and function or 
an alternative explanation for observed behavior, then searching the space of 
hypo theses is an extreme form of evaluating multiple alternative hyp otheses 
(e.g.. lCaroenter et all ITool iHilborn and Miange3.ll997l : lLoehlelll987bh . More 
closely related is the work of Jost and Arditil (|2001 ). who manually fit models 
with alternative formulations of the predation process to several predator-prey 
data sets. Their objective was to determine whether predation was best ex- 
plained by predator density-dependent or -independent processes. Although 
they compared alternative processes, they view their modeling approach as 
a statistical fitting and selection of nonlinear models to p opulation data, a 
methodology gaining prom i nence in population ecology (e.g., Carpenter et al. 



1994 iHilborn and Mangel . ll997tUost and Arditll2000Ll200lh . However, their 
approach lacks an explicit notion of modeling as search. 



The most closely related research is the approach to model revision introduced 
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by Reynolds and Ford ( 19991 ). They describe a multi-criteria assessment tech- 
nique for evaluating process-based models that is used in an iterative modeling 
cycle to guide revision of both model structure and parameter selection. Like 
the modeling approach discussed in this paper, the goal is to discover better 
model formulations. Two important differences stem from the fact that their 
approach does not provide a way to specify domain theory a priori. First, their 
procedure only allows revision of an existing model; I t cannot guide search 
for the initial model. Second, the methodology detects and generally locates 
model deficiencies, but it cannot suggest specific model revisions to make. The 
scientist must manually revise the model and then reapply the multi-criteria 
assessment. 



5 Directions for Future Work 



As the Ross Sea model shows, the approach described in this paper can be 
an effective tool for constructing, organizing, and communicating complicated 
models. However, there remain several ways in which the approach can be ad- 
vanced. In particular, we see three ways of extending our approach to improve 
its utility for modeling environmental phenomena. 



The first extension involves implementing a hierarchical organization of both 
entities and processes. Part of the complexity of environmental systems is that 
their components operate and interact with each other at multiple scales of 
space and time. Hierarchical entities and processes are one way to capture this 
complexity that matches how biologist and ecologists tend to think about them 
( Ahl and Alien! Il99fil : lAUen and Starr) . Il982l lO'Neill et aD . Il98fil ). For exam- 
ple, organisms are classified using a taxonomic hierarchy. The White-throated 
Sparrow (Zonotrichia albicollis) is in the family Emberizidae, which i s in the 
class Aves of the Animal kingdom ((National Geographic Society . 19991 ). Mem- 
bers of each taxonomic level share certain properties that are used to classify 
them. We may also use hierarchical entities to encode spatial phenomena. We 
envision using a set of entities to represent different physical locations, where 
each "spatial entity" contains relevant entities and processes for that space. 
For example, in a lake ecosystem model we might represent the areas above 
and below the thermocline as two distinct entities. Finally, hierarchical pro- 
cesses would let us explicitly represent exudation and nutrient uptake in the 
Ross Sea ecosystem model as subprocesses of growth. We have already be- 
gun work on developing ways to represent and use such hierarchical processes 



( Todorovski et all 12005) 



The second extension addresses the criteria for selecting models. Informa- 
tion about ecological systems is typically heterogeneous; ecologists generally 
know more about some parts of the system then others. For example, they 
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may have continuous-time observations for some system variables but only 
know the appropriate ranges or general trajectory shapes for others. Scien- 
tists often ev aluate the pl a usibil ity of simulation models using all of these 
criteria (e.g., Arrigo et al. . 20031 ). At present, Prometheus only provides 
information about standard goodness-of-fit measures for the variables with 
continuous-time data. Although we may use this additional information to 
evaluate the models manually, we may also be able to encode the information 
and use it to guide automated search and selection of appropriate models. 



Finally, this modeling methodology creates new possibilities for analyzing sys- 
tem behavior. Sensitivity analysis is commonly used in ecological modeling to 
determine the effect of varying one or more parameters on the model behav- 
ior. However, we expect that performing a process-level sensitivity analysis 
will lead to increased understanding of system dynamics because it will reveal 
which processes are primarily responsible for dynamics at a given time. This 
information will be useful for guiding additional experimental work, as well 
as assisting environmental assessment and manage ment. Usin g the m ore com- 
mon process-based ecological modeling approach, iBrugnach (2005) provides 
an initial example of how such process sensitivity analysis might work. 



6 Summary 



In this paper, we described a novel method for representing and developing 
simulation models of complex ecological and environmental systems. The rep- 
resentation builds on a two-part ontology that includes entities and processes. 
With this formalism, we represent quantitative models in a way that approx- 
imates how scientists think about systems. We also develop reusable libraries 
of generic entities and processes based on existing ecological knowledge. We 
claim that these libraries link model development to existing theory, facilitate 
model evaluation, and expedite model construction. 

Two features contribute to the novelty of our modeling approach. First, we 
view model construction as search through a space of possible models. Second, 
we instantiate model elements from generic components. The space of theo- 
retically plausible models and the generic components are defined in libraries 
of domain specific entities and processes. We expect this approach to facilitate 
the construction of ecological models both for theoretical development and for 
environmental assessment and management. 
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